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Abstract 

Gaussian noise stability results have recently played an important role in proving results in hardness 
of approximation in computer science and in the study of voting schemes in social choice. We prove a 
new Gaussian noise stability result generalizing an isoperimetric result by Borell on the heat kernel and 
derive as applications: 

• An optimality result for majority in the context of Condorcet voting. 

• A proof of a conjecture on "cosmic coin tossing" for low influence functions. 

We also discuss a Gaussian noise stability conjecture which may be viewed as a generalization of the 
"Double Bubble" theorem and show that it implies: 

• A proof of the "Plurality is Stablest Conjecture" . 

• That the Frieze- Jerrum SDP for MAX-q-CUT achieves the optimal approximation factor assuming 
the Unique Games Conjecture. 

1 Introduction 

Recent results in hardness of approximation in computer science [22, 9, 2, 3, 31, 14] and in the study of 
voting schemes in social choice [18, 2(i] crucially rely on Gaussian noise stabihty results. The first result in 
hardness of approximation established a tight inapproximability result for MAX-CUT assuming the Unique 
Games Conjecture [21] while the latest results conditionally achieve optimal inapproximation factors for very 
general families of constraint satisfaction problems [ ', ' ']. Results in social choice include optimality of the 
majority function among low influence functions in the context of Condorcet voting on 3 candidates [LS] 
and near optimality for any number of candidates [2(>]. A common feature of these results is the use of 
"Invariance Principles" [28, 29, 2()] together with an optimal Gaussian noise stability result by Borell [4]. 

In the current paper we prove a theorem generalizing the result of Borell [4], discuss a related conjecture 
and develop an extension of the invariance principle. As applications we derive some new results in hardness 
of approximation and social choice. In the introduction we state the theorem and the conjecture together 
with their applications and provide "moral support" for the correctness of the conjecture. 



1.1 Maximally Stable Gaussian Partitions 

We will be concerned with finding partitions of M" that maximize the probability that correlated Gaussian 
vectors remain within the same part. More specifically we would like to partition R" into q > 2 disjoint sets 
of fixed measure. 

Borell [-1] proved that when q — 2 and we have two standard Gaussian vectors with covariance p > in 
corresponding coordinates then half-spaces are optimal. Let /„ be the n x n identity matrix. For two n- 
dimensional random variables X = {Xi, . . . , Xn) and Y — (Yi, . . . , Yn) write Cov(X, Y) for the nxn matrix 
whose (i, j)'th entry is given by Co-v[Xi,Yj] = E[XiYj] - E[Xi] E[Yj]. Recall that X ^ N{0, In) means that 
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X is a standard rt-dimensional Gaussian vector with independent entries, all of which are standard normal 
random variables. Borell's result states the following: 

Theorem 1.1. [4] Fix p G [0, 1]. Suppose X^Y ^ N(0,/„) are jointly normal and Cav{X,Y) = pin- Then 
for any ^1,^2 CM", 

V{x e Ai, y e A2) < P{x e Hi,y e H2) 

where Hi = {x e W^\xi < aj for Oi chosen so that P(X G Hi) = P(X G Ai). 

We will consider two different generalizations of Theorem 1.1. The first generalization claims that half- 
spaces are still optimal if we have k > 2 correlated vectors and seek to maximize the probability that they 
all fall into the same part: 

Theorem 1.2 (Exchangeable Gaussians Theorem, or EGT). Fix p E [0, 1]. Suppose Xi, . . . ,Xk ^ N(0,/„) 
are jointly normal and Cov{Xi, Xj) = pin for i ^ j ■ Then, for any ^i, . . . , An C M", 

P(Vi : X, e Ai) < P(Vi : X, e H,) (1) 

where Hi = {x e IR"|a;i < a^} for Oi chosen so that P{X G Hi) = P{X e A^). 

We call the theorem above the Exchangeable Gaussians Theorem (EGT). Recall that a collection of 
random variables is exchangeable if its distribution is invariant under any permutation. 

The second generalization of Theorem 1.1 concerns the optimal partition of M" into q > 2 sets. We 
conjecture that when the partition is balanced (i.e. all q sets have equal measure) the optimal partition 
is a standard simplex partition dividing M" into q partitions depending on which of q maximally separated 
unit vectors are closest (ties may be broken arbitrarily), and further that this is the least stable partition for 
p < 0, even for unbalanced partitions. 

Definition 1.3. For n+1 > q > 2, Ai, . . . , Aq is a standard simplex partition o/R" if for all i 

Ai ^ {x eW^\x ■ > X ■ a^yj ^ i} 

where ai, . . .Oq € R" are q vectors satisfying 

«.-. = |„_l_ (2) 

When n > q a standard simplex partition can be formed by picking q orthonormal vectors ei, . . . ,eg, 
subtracting their mean and scaling appropriately, i.e. 




and for n = (7 — 1 it is enough to project these vectors onto the q — 1-dimensional space which they span. 
We call Ai, . . . ,Aq a balanced partition of M" if ^1, . . . , are disjoint with P{X e Aj) ~ i, Vj. 

Conjecture 1.4 (Standard Simplex Conjecture, or SSC). Fix p E [—1,1] and 3 < q < n + 1. Suppose 
X,Y ^ N(0,/„) are jointly normal and Cov{X,Y) = pin- Let Ai,. .. ,Aq C R" be a partition 0/ R" and 
Si, . . . , Sq E- R" a standard simplex partition. Then, 

i) If P ^ o.'i^d Ai, . . . , Aq is balanced, then 

P{{x, Y)eAlu---uAl)< P{{X, Y)eSfu---U S^) (3) 

ii) If p < 0, (3) holds in reverse: 

P{{x, Y) e Alu---uAl)> P{{X, Y)eSfu---u SD 

Conjecture 1.4 was made by the second author around 2004 during the writing of [''^]. 

When (7 = 3 the standard simplex partition, also known as the standard Y partition or the peace sign 
partition, is described in R^ by three half- lines meeting at an 120 degree angle at the origin (Figure 1.1) and 
in R", where n > 2, it can be exemplified by taking the Cartesian product of the peace sign partition and 
R"-2. 
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Figure 1: The peace sign partition 



1.2 Applications 

Given the numerous apphcations of the results of Borell together with invariance [22, 9, 2, 3, 31, 14, 18, 26], 
it is natural to expect that the generalizations discussed here will have a wide variety of applications. Here 
we derive the first applications in social choice theory and hardness of approximation in computer science: 

• From the EGT we derive certain optimality of majority in Condorcet voting. More specifically, majority 
asymptotically maximizes the probability of having a unique winner in Condorcet voting with any 
number of candidates among low influence voting schemes. It also maximizes the probability of k 
players agreeing, among low-infiuence functions, in the setting of cosmic coin flipping [27, 30]. 

• The SSC implies the Plurality is Stablest conjecture as well as showing that the Frieze- Jerrum [12] 
SDP relaxation obtains the optimal approximation ratio for MAX-q-CUT assuming the Unique Games 
Conjecture. 

The main tool for proving these applications is the invariance principle of [28, 29, 26] which we extend to 
handle general Lipschitz continuous functions. We note that previous work proved the invariance principle for 

functions and some specific Lipschitz continuous functions. The generalization of the invariance principle 
may be of independent interest. We proceed with formal statements of the applications. 

1.2.1 Condorcet voting 

Suppose n voters rank k candidates by each voter i providing a linear order at £ S{k) on the candidates. In 
Condorcet voting, the rankings are aggregated by deciding for each pair of candidates which one is preferred 
over the other by the n voters. This decision can be performed in many ways, but we will require that it 
satisfies two criteria 

• Independence of Irrelevant Alternatives (HA). The decision of whether a is preferred over b can only 
depend on each voter's preference between a and b. 

• Neutrality. The decision must be invariant under permutations on the k candidates. 

More formally, the aggregation results in a tournament Gk on the set [k] . Recall that Gk is a tournament 
on [k] if it is a directed graph on the vertex set [k] such that for all a,b £ [k] either {a > b) E Gk or 
(& > a) e Gk. Given individual rankings (cri)"=i the tournament Gk is defined as follows. Let 

■i{a) > C7,{b) j^^-^ b€[k]. 

Note that x''^" = — By IIA and neutrality we may assume that the binary decision between each pair 
of candidates is performed via an anti-symmetric function / : {—1, 1}" — > {0, 1} so that f{—x) = 1 — f{x) 
for all X G {—1, 1}". The tournament Gk — Gk{<J',f) is then defined by letting {a > b) G Gk if and only 
if f{x°'^^) = 1. A natural decision function is the majority function MAJ„ : {—1, 1}" {0, 1} defined by 
MAJ„(a;) = ^{Y."=^xi>G}- 

For the purposes of social choice, some tournaments make more sense than others. For example, it is 
desirable to have Gk provide a linear ranking of the candidates. But this is more than we can hope for, 
since by Arrows impossibility theorem [ ] a linear ranking can only be guaranteed when / is a dictator, i.e. 
f{x) = . A weaker requirement is that there exist a Condorcet winner or a unique best candidate in Gfc, 
i.e. for some a e [fc]: (a > 6) G Gk,yb ^ a. 

Following [ly, 1^, 2G], we consider the probability distribution over n voters, where the voters have 
independent preferences and each one chooses a ranking uniformly at random among all k\ orderings. Note 
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that the marginal distributions on vectors x""^^ is the uniform distribution over {—1, 1}" and that if / : 
{— 1, !}"■ — > {0, 1} is anti-symmetric then E[/] = i. The previous discussion and the foUowing definition are 
essentially taken from [2(i]. 

Definition 1.5. For any anti- symmetric f : {—1, 1}" —>■ {0, 1} let UniqueBest^(/) denote the event that the 
Condorcet voting system described above results in a unique best candidate and UniqueBestj.(/, i) the event 
that the i:th candidate is unique best. 

Consider first the case fc = 3. In this case has a unique best candidate if and only if it corresponds to 
a linear ranking of the candidates. Kalai [ ] studied the probability of a rational outcome (i.e. G3 linear) 
given that the n voters vote independently and at random from the 6 possible rational rankings. He showed 
that the probability of a rational outcome in this case may be expressed as 3 §i/3(/). Here Sp(/) denotes 
the noise stability §p(/) = 'E[f{X)f{Y)] where X is uniform on { — 1,1}" and Y is obtained from X by 
independently rerandomizing each coordinate with probability 1 — p. 

The influence Infi / of voter i, is the probability that voter i can change the outcome of the election, i.e. 
Inf,/ = P{f{X) ^ where is obtained from X by flipping the i:th coordinate. It is natural 

to ask which function / with small influences is most likely to produce a rational outcome. Instead of 
considering small influences, Kalai considered the essentially stronger assumption that / is monotone and 
"transitive-symmetric"; i.e., that for all 1 < i < j < n there exists a permutation a on [n] with a{i) — j such 
that f{xi, . . . ,Xn) — f{xa-(i), ■ ■ ■ ,Xa-{n)) for all {xi, . . . ,Xn)- Kalai conjectured that as 71 —> 00 the maximum 
of 3Si/3(/) among all transitive-symmetric functions approaches the same limit as lim„^oo 3 §i/3(MAJ„). 
This follows directly from the Majority is Stablest Theorem [2n, 2!)]. In [2(1] similar, but sub-optimal results 
were obtained for any value of k. More specifically it was shown that if one considers Condorcet voting on k 
candidates, then for all e > there exists r = r(fc, e) > such that if / : { — 1, 1}" {0, 1} is anti-symmetric 
and Infi(/) < r for all i, then 

P[UniqueBestj;(/)] < fc-i+°i-(i) + e. 
Moreover for the majority function we have Infj(MAJ„) ~ 0{n^^^^) and it holds that 

P[UniqueBestjMAJ„)] > k-'^-""'^^^ - o„(l). 
As a consequence of the EGT we provide tight results for every value of fc. 

Theorem 1.6. For any fc > 1 and e > there exists a T(e,k) > such that for any anti- symmetric 
f : { — 1, 1}" ^ {0, 1} satisfying max, Inf^ f < t, 

P[UniqueBestfe(/)] < lim P[UniqueBestfe(MAJ„)] -I- e 

n — *oo 

1.2.2 Cosmic coin flipping 

In the setting of cosmic coin flipping as studied in [27, 30], we have k players and a source X £ {—1, 1}" of 
n uniform bits. Each player i is given a noisy version Yi of X, where each bit in Yi is a noisy copy of the 
corresponding bit in X. More specifically, given X, each l^j- is selected independently as 

X, w.p. i±£ 
-X, w.p. V 

Note that ^[XjY^j] = p and ^%jYij] ^ p'^ for i ^ I. 

The fc players want to use their noisy versions of X to flip a balanced coin such that they all agree on 
an outcome with maximal probability, i.e. they want to select a balanced function / : {—1, 1}" — > {0, 1} 
maximizing 

V^^'^Hf) :=P(/(Fi) = ... = /(rfe)) 

The requirement that all players must use the same function should not be considered a restriction since, as 
shown in [ ' '], allowing each player to use a different function fi cannot increase the maximal probability of 
all players agreeing. 
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The main conjecture of [27] is that for any fixed fc, n and p, Vp [f) is maximized by MAJ„i for some 
m < n. As a consequence of the EGT we show that if / is required to have low influence in each coordinate, 
then majority asymptotically maximizes 7^p'^'"''(/) for unbounded n, 

Theorem 1.7. For any fc > 1, p G [0, 1] and e > there exists a T{e,k,p) > such that for any balanced 
f : { — 1, 1}" {0, 1} satisfying maxj Inf^ f < t, 

p(fc,n)(j)< lim 7'(^^"nMAJ„) + e 

1.2.3 Plurality is Stablest 

Consider an election with n voters choosing between q candidates. We call a function / : [q]" [q], 
which given the n votes determines the winning candidate, a social choice function. Letting Aq = {a; G 
M'ja; > 0,J2i=i^i — 1} denote the standard q-simplex, we generalize this notion a bit and call a function 
/ • assigning a probability distribution to the set of candidates a "fuzzy" social choice function. To 

be able to treat non-fuzzy social choice functions at the same time, we will usually embed their output into 
Aq and think of them as functions / : [g]" Eq, where Eg = {ei, . . . , e,} = {(1, 0, . . . , 0), . . . , (0, . . . , 0, 1)} 
are the q extreme points of A^ corresponding to assigning a probability mass 1 to one of the candidates. 

The noise stability of such functions measures the stability of the output when the votes are chosen 
independently and uniformly at random, and then rerandomized with probability 1 — p. 

Definition 1.8. For — < P < 1, the noise stability of f : [g]" M'"' is 

k 

§,(/) = 5]e[/,h/,(a)] 

where ui is uniformly selected from [q]" and each Xi is independently selected using the conditional distribution 

/i(A,|u;,)=pl{A,=.^,} + (l--°)J (4) 

Note that when / : [g]" is a non-fuzzy social choice function, Sp(/) = P{f{u!) = /(A)). 

We say that / : Aq is balanced if E[/([x))] = ^1 where uj is uniformly selected from and say 

that the influence of the i:th coordinate on a real valued function / : [q]" ^ M is 

Inf,/H=E[Var/H] (5) 

Note that the definition of Sp generalizes the definition for Boolean function in Section 1.2.1 if we identify 
[2]" with {—1, 1}". Likewise, the definition of influence here generalizes the notion of influence in Section 1.2.1 
except that the earlier notion is precisely 4 times larger. This distinction is usually not important, but when 
it is we will use the latter one given by (5). 

Let PLURn^g : [g]" Aq denote the plurality function which assigns a probability mass 1 to the 
candidate with the most votes (ties can be broken arbitrarily, e.g. by splitting the mass equally among the 
tied candidates). The Plurality is Stablest conjecture claims that plurality is essentially the most stable of 
all low-influence functions under uniform measure: 

Conjecture 1.9 (Plurality is Stablest). For any q > 2, p E [—-^pj, 1] and e > there exists a t > such 
that if f : [g]" Aq has Infi(/j) < r, Vi,j, then 

Sp(/) l£ 1™ Sp(PLUR„,g) + e if p > and f is balanced 

and 

Sp(/) > lim Sp(PLUR„,q) -e tfp<0 (6) 
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The case where q = 2, the Majority is stablest theorem, was proved in [2?)]. The question if Plurahty 
is Stablest or not has been discussed by Khot, Kindler, O'DonneU and the second author as part of their 
work [22]. Here we conjecture that Plurahty is indeed stablest. 

We show that the general case follows from SSC. 

Theorem 1.10. SSC (Conj. 1.4) Plurality is Stablest (Conj. 1.9) 

It should be pointed out that our results imply a slightly stronger result where the small influence 
requirement is replaced by a small low-degree influence requirement. This strengthening turns out to be 
crucial to applications in hardness of approximation. 

We also show the reverse implication for p > ^^ri' implying that the Plurality is Stablest conjecture is 
equivalent to the SSC for p in this range. 

Theorem 1.11. Plurality is Stablest (Conj. 1.9) ^ SSC (Conj. H) for p g [-^, 1] 

It follows from calculations in [ ] that the bound (6) in Conjecture 1.9 holds asymptotically for p — — 
as q — s- oo, i.e. 

§_^(/) > (1 - o,(l)) • lim §p(PLUR„,5) +e,(r) (7) 

g-i n — >oo 

where eqir) — > as r — s- 0. 

It may be helpful to think of the theorem in terms of a pure social choice function / : [g]" ^ [q\. In this 
case, there are n voters and each voter chooses one out of q possible candidates. Given individual choices 
Xi, . . . , Xn, the winning candidate is defined to be /(xi, . . . , a;„). In social choice theory it is natural to 
restrict attention to the class of low influence functions, where each individual voter has small effect on the 
outcome. We now consider the scenario where voters have independent and uniform preferences. Moreover, 
we assume that there is a problem with the voting machines so that each vote cast is rerandomized with 
probability \ — p. Denoting by Xi, . . . ,X„ the intended votes and Yi, . . . ,Yn the registered votes, it is 
natural to wonder how correlated are f{Xi, . . . , Xn) and f{Yi, . . . , Yn). Theorem 1.10 states that under 
SSC, the maximal amount of correlation is obtained for the plurality function if p > 0. The case where p < 
corresponds to the situation where the voting machine's rerandomization mechanism favors votes that differ 
from the original vote. In this case the theorem states that plurality will have the least correlation between 
the intended outcome f{Xi, . . . , Xn) and the registered outcome /(Yi, . . . ,Yn). In the next subsection we 
discuss applications of the result for hardness of approximation. 

1.2.4 Hardness of approximating MAX-q-CUT 

For NP-hard optimization problems in theoretical computer science research is conducted to find polynomial 
time approximation algorithms that are guaranteed to find a solution with value within a certain constant of 
the optimal value. Hardness of approximation results on the other hand bound the achievable approximation 
constants away from 1. For some problems, tight hardness results have been show where the bound matches 
the best known polynomial time approximation algorithm. For instance, Hastad [ ' '] showed that for MAX- 
ES-SAT one cannot improved upon the simple randomized algorithm picking assignments at random thus 
achieving an approximation ratio of | . 

In general, for constraint satisfaction problems (CSP's) where the object is to maximize the number of 
satisfied predicates selected from a set of allowed predicates and applied to a given set of variables, algorithms 
based on relaxations to semi-definite programming (SDP), first introduced by Goemans and Williamson [13] 
has proved very successful. 

Still optimal hardness results are not known for many CSP's. One promising direction forward is the 
Unique Games Conjecture (UGC), a strengthened form of the PGP Theorem introduced by Khot [20]. 
Although the UGC remains open, hardness results for many problems has since been proved under the 
assumption of the UGC, including optimal resuffs for MAX-CUT [22, 29] and VERTEX-COVER [.. ], and 
improved results for SPARSEST-CUT [ , ,]. Recently Raghavendra [32] showed tight hardness results for 
any MAX-CSP assuming the UGC, albeit without giving explicit optimal approximation constants. 

In Appendix A. 3 we consider one such problem that is known to be related to Plurality is Stablest. In the 
MAX-q-CUT or the Approximate q-Coloring problem, we are given a graph (possibly edge weighted) and we 
seek a g-coloring of the vertices that maximizes the number (or weight) of edges between differently colored 
vertices. 
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Definition 1.12. The weighted MAX-q-CUT problem, Aiq{V, E,w), is defined on a graph {V,E) with a 
weight function w : E ~* [0, 1] assigning a weight to each edge. A q-cut I : V [q] is a partition of the 
vertices into q parts. The value of a q-cut I is 

\KLl{Mq) = ^ Wl^u,v) 

The value of Mq is 

YAL{Mg) = maxVAL;(A^ J 



Frieze- Jerrum gave an explicit SDP relaxation of MAX-q-CUT (see Appendix A. 3) which was rounded 
using the standard simplex partition of Conjecture 1.4. In Appendix A. 3 we show that Conjecture 1.4 implies 
that this is optimal. 

Theorem 1.13. Assume Conjecture I.4 and the UGC. Then, for any e > there exist a polynomial time 
algorithm that approximates MAX-q-CUT within Uq — e while it is NP-hard to approximate MAX-q-CUT 
within Uq -\- 1. 
Here, 

q i-qi{p ) 
j^'<p<i q-i 1 - p 



1= >f „\ , (8) 



where ql{p) is the noise stability of a standard simplex partition Si, . . . , Sq of , i.e. 

qI{p) = PiiX,Y)eS'iU---US'g) 

where X,Y N(0,/q-i) are jointly normal with Cov(A, y) = piq-i. 

We note that a2 ~ 0.878567 is the Goemans- Williamson constant [ ]. It is conjectured that (8) attains 
it minimum at p = for > 3 (but not for k — 2). This was verified numerically in [ ] for 

= 3 . . . 10, where as, ... , aio were also computed. For instance, « 0.836008 and 04 « 0.857487. 

We further comment briefly on the results of [32]. Since MAX-q-CUT is an example of MAX-CSP with 
a single predicate, [V'] give an optimal approximation algorithm for MAX-q-CUT and an algorithm for 
computing the optimal approximation constant. However, the complexity of both these algorithms depends 
heavily on the precision e. In fact, the nmning time is doubly exponential in 1/e. In contrast our results 
(assuming the SSC) gives the optimal approximation constant as simple optimization problem in one variable. 



1.3 Support for the SSC 

To support the Standard Simplex Conjecture we first note that it is a natural extension of Theorem 1.1. 
Moreover, by Theorem 1.10 and 1.11 it is (for p e [^^^y 1 1]) equivalent to the Plurality is Stablest conjecture 
which is a natural extension of the Majority is Stablest theorem. By (7) this extension holds asymptotically 
as g ^ 00. In the limit as p ^ 1 further support is given by the Double Bubble Theorem in Gaussian space 
as we explain next. 



1.3.1 The Double Bubble Theorem 

The famous Double Bubble Theorem [Ki] determines the minimal area that encloses and separates two fixed 
volumes in R^. The optimal partition is given by two spheres which intersect at an 120 deg angle having a 
separating membrane in the plane of the intersection. The proof of this theorem is the culmination of a long 
line of work answering a conjecture which was open for more than a century. 

An analogous question can be asked in Gaussian space, R" equipped with a standard Gaussian density 
and the techniques and results used in the proof of the Double Bubble Theorem allow to find the partition of 
R"(n > 2) into three volumes each having Gaussian volume ^ minimizing the Gaussian surface area between 
the three volumes. Indeed, the results of [7] show that the optimal partition is the Peace Sign partition, 
which can be seen as the limit of the double bubble partition scaled up around one point on the intersection. 
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Figure 2: A double bubble in 

This indicates that the partition in Conjecture 1.4 is optimal (at least ioi q — 3 when p ^ 1). Indeed 
Conjecture 1.4 is stronger than the results of [7]. It is easy to see that Conjecture 1.4 with q — 3 imply 
that the Peace Sign Partition is optimal by taking the limit p ^ 1 (this is done similarly to the way in 
which Borell's result [4] implies the classical Gaussian isoperimetric result, see Ledoux's Saint-Flour lecture 
notes [Hi]). 

1.4 Organization 

In Section 2 we introduce the notation we use and various definitions and results from previous work, while 
also proving some useful properties of Gaussian noise stability. In Section 3 we describe the invariance 
principle which is used to relate certain questions about discrete noise stability to questions about Gaussian 
noise stability. Then, in Section 4 we prove the Exchangeable Gaussian Theorem and in Section 5 we prove 
a general noise stability bound for discrete low-influence functions. The following sections treats various 
applications. The first two applications are based on the EGT. In Section C we show that the majority 
function maximizes the probability of having a unique best winner in Condorcet voting and that majority 
is best for cosmic coin flipping among low-influence functions. The next two applications are based on the 
Standard Simplex Conjecture. In Section 7 we show that the Plurality is Stablest Conjecture follows from 
the SSC - and essentially is equivalent to the SSC. Based on the results of Section 7, we also include a proof 
in Appendix A of the optimality of the Frieze-Jerrum SDP for approximating MAX-q-CUT given the SSC 
(and assuming the Unique Games Conjecture). 

2 Preliminaries 

In this section we introduce some notation and recall various definitions and results from [29, 2G]. Further- 
more, we derive some useful properties of Gaussian noise stability in section 2.7. 

2.1 Conventions 

To make it more clear whether we are working with functions on discrete space / : [g]" ^ M*^ or functions on 
continuous Gaussian space g : M" M.'' wc will usually use / to denote discrete functions and g to denote 
continuous functions. 

For a discrete function we will write E/ for F,f{uj) where uj is uniformly selected from [g]" and II/II2 = 
E(/(w), f{Lu)) and similarly for a continuous function we will write E g for Eg{X) where X ^ N(0, /„) and 
= E{g{X),g{X)). We also say that g e if Wgh < 00. 

2.2 Multilinear polynomials 

Consider a product probability space (f^,/i) = (Iir^i ^i^]Xi=i f^i)- ^'^^^ be interested in functions / : 
nr=i ^» — *■ K on such spaces. For simplicity, we will assume that each /i^ as full support, i.e. fJ,i{u!i) > 
OjVwi e Qi. Then clearly, for each coordinate i we can create a (possibly orthonormal) basis of the form 

Xi = (Xj^o = 1, ^1,1, ■ • • , (9) 
where E[Xij] — for j > 1, for the space of functions Hi M. 

Definition 2.1. We call a finite sequence of (orthonormal) real-valued random variables where the first 
variable is the constant 1 and the other variables have zero mean an ( orthonormal) ensemble. 
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Thus, X ~ [Xi, . . . is an independent sequence of (possibly orthonormal) ensembles. We will only 
be concerned with independent sequences of ensembles, however we will not always require the ensembles to 
be orthonormal Another type of ensembles are the Gaussian ensembles, of which an independent sequence 
is typically denoted by Z = (Zi, . . . , Z„) where Z^ — {Zifi — 1, Zi i, . . . , Zi^mi) and each Zij is a standard 
Gaussian variable. 

Definition 2.2. A multi-index a is a sequence of numbers (cti, . . . ,crn) such that a i > 0,Vi. The degree \(t\ of 
cr is \{i G [n] : Gi > 0}|. Given a set of indeterminates {xi_j}i^\^n],o<i<mij 1st = OlLi "^i^^i- A multilinear 
'polynomial over such a set of indeterminates is an expression Q{x) — CaXcr where Co- G K are constants. 

Continuing from (9) and letting X^^ — JliLi should be clear that {Xc} forms a basis for functions 

nr=i ^« ^ hence any function / : Y[i=i ^ ^ can be expressed as a multilinear polynomial Q over X: 

/K,...,c^„) = g(A'i,...,A'„) = ^c,X, (10) 

Definition 2.3. The degree of a multilinear polynomial Q is 

deg Q = max \a\ 

We will also use the notation Q-'^ to denote the truncated multilinear polynomial 

CT:|cr|<d 

and the analogous for Q^'^ and Q^'^ . 

Definition 2.4. Given a multilinear polynomial Q over an independent sequence of ensembles X = {Xi, . . . , Xn), 
the influence of the i:th coordinate on Q{X) is 

Inf,; Q{X) - E [Var[QiX)\Xi, . . .,X,_i,X,+i, . . . A"™]] 

We also define the d-degree influence of the i:th coordinate as 

InfPQ(A') =lnf,Q^'^(A') 

Note that neither the degree nor influences of Q{X) depends on the actual basis selected in (9), hence 
we can write deg / = deg Q, Infi / = Infi Q{X) and Inf^'' / = Infi Q-''-{X). 

2.3 Bonami-Beckner noise 

Let us first define the Bonami-Beckner noise operator. 

Definition 2.5. Let (f2, /i) ~ (n"=i n"=i /^«)- a finite product probability space and a the minimum 
probability of any atom in any Qi. For < p < 1 the Bonami-Beckner operator on functions f : 

nr=i ~^ defined by 

Tpf{i^i, . . . ,w„) = E[/(Ai, . . . , A„)|tJi, . . . w„] 
where each Xi is independently selected from the conditional distribution 

fii{Xi\uJi) = /3l{Ai=a;,} + (1 - P)t^l{^l) 

For p g [0, 1] this is equivalent to Tpf being the expected value of / when each coordinate independently 
is rerandomizcd with probability 1 — p. 
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2.4 Orthonormal ensembles 



Most of the time we will work with orthonormal ensembles. Using independence and linearity of expectation 
it is easy to see that if Q{X) — Ca^X^ is a multilinear polynomial over an independent sequence of 
orthonormal ensembles, then 



cl 



E[Q(A')] = CO ; Var[Q(A')] = ^ ; Inf,Q(A') = ^ 

a:\cr\>0 cr:o-i>0 

cr a rcri>0 

'^'\\o\<d 

Combining these expressions it is also easy to see that Inf^*^ / is convex in / and satisfies the following 
bound on the sum of low-degree influences: 

n 

Elnfr''/<dVar/ (12) 

i=l 

2.5 Vector- valued functions 

Since we will work extensively with vector-valued functions wc make the following definitions: 
Definition 2.6. For a vector-valued junction f = (/i, . . . , fk), let 

k k 



Var / = E ' * / = E ' 

3=1 J=l 



and similarly for Inf^''. 



Thus (12) holds even for vector-valued /. Also, all expressions in (11) hold for vector-valued multilinear 
polynomials Q{X) = CaX^, where Ca G and X is an independent sequence of orthonormal ensembles, 
if we replace cl with \\c„\\l and i:,[Q{Xf] by ||(3(A')|||. 

Finally, by expressing functions / : [5]" ^ M.^ under the uniform measure on the input space [g]" as a 
multilinear polynomial 

n 

/(^) = E "^"^ II ^^^'^i ^^^^ 

(7 i—1 

this lets us express the noise stability of Definition 1.8 as 

§p(/) = E[(/,T,/)]=E'^'"'l|C'^ll2 (13) 

a 

2.6 Correlated probability spaces 

It will be important for us to bound the effect of the Bonami-Beckner noise operator on functions on correlated 
probability spaces. 

Definition 2.7. Let {Qi x f22,/i) be a correlated probability space. The correlation between f2i and Q2 with 
respect to fi is then 

p(17i,f]2;/i) = sup Cov(/i(wi), 72(^2)) 

/i:Oi^R,Var/i = l 



For {Qi X ■ ■ ■ X Clii., fi) we let 



p{Qi, . . . ,fifc;^) = max p rij,TTr2j;^ 

l<i<k \ I 
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The following lemma shows that the expected value of products of functions where corresponding coordi- 
nates form correlated probability spaces does not change by much when some small noise is applied to each 
coordinate: 



Lemma 2.8. [ , Lemma 6.2] Let (JlILi IlILi /^«) finite product probability space where fJ^ = 

(f2^, . . . , r2*^) are correlated probability spaces with p{nj , . . . , il^ ; fii) < p < 1. For j = l...k, let = 
(Xf , . . . , X,-[) be an independent sequence of orthonormal ensembles such that X^ forms a basis for functions 
57^ R and Qi, . . . , multilinear polynomials bounded by \Qj{X^)\ < 1. Then, for all e > there exists a 
7 = 7(e, p) > such that 

k k 

E Y[ Q, {X^ ) - E J] Ti_^Q, {X^ ) 



< e-k 



To verify the assumption p{^}, ■ ■ ■ , ilf ; pi) < 1 the following lemma is useful: 

Lemma 2.9. [ , Lemma 2.9] Let (f2i x ^2,p) be a correlated probability space such that p{ll!i,lu2) > a or 
p{lui,uj2) — for all uJi,uj2- Define a bipartite graph G — U 0.2, E) where (a, b) £ E if p{a, b) > 0. Then, 
if G is connected, then 

pini,n2;p) < l-aV2 



2.7 Gaussian noise stability 

Definition 2.10. For p G [—1, 1], the Ornstein-Uhlenbeck operator Up is defined on functions g : M" M'^ 
such that g G L^ , where X ~ N(0, by 



Upgix) - E g{px + ^1 - p^O 



where ^ N(0, /„). 



It is easy to see that if Z = (Zi, . . . , Z„) is a Gaussian sequence of independent ensembles and Q{Z) 
X^cr Ccr-Zo-, then 



Thus Up and Tp acts identically on multi-linear polynomials over Gaussian sequences of independent ensem- 
bles. 

Analogous to the expression (13) of discrete noise stability in terms of the Bonami-Beckner operator, we 
define the Gaussian noise stability in terms of the Ornstein-Uhlenbeck operator, 



Definition 2.11. For any g 



Ag, let 



Sp{g)^E[{g,Upg)] 



Note that we use the same notation Sp for both discrete and Gaussian noise stability. The intended kind of 
noise should always be clear from the context. 

A convenient property of the Ornstein-Uhlenbeck operator is that it creates continuous functions. The 
following result is well known: 

Lemma 2.12. For any p e (—1, 1) and g : M" — > A^, Upg{x) is continuous in x. 
Proof. 

\\Upg(x) - Upg{y)\\2 = \\ Eg{U) - Eg{V)\\2 
where U ^ px + i/l - p^^^ , V ^ py + i/l ^ p'^S.y and £,y ~ N(0, /„). 
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First note that if X G N(/i, 1) and Y E N(— /i, 1), then the total variation distance between X and Y is 

1 



2V2tt 
1 



e 2 _ e 2 



E 

zeN(o,i 



1 



2V27r 



Hence, 



and 



dTv(Ui,Vi) = dri^ 



If. _ „£l±M T/. _ „ Xi+y, \ 



2 (xj-yj)^ 



i=l 

Since we can couple U and F such that they are equal except with probability dTv{U, V), we have || Eg{U) — 
Eg(F)||2 — > as ||x - y\\2 ^ as needed. □ 

Also, applying some small noise will not affect the noise stability much, 

Lemma 2.13. For p e [-1, 1] and g : M" Ag, 

Sp{Ui-tg) — > Sp(5) as e — > 



Proof. 

|§p(C/i_,5)-§p(5)| = |E({/i_,.g,C/p[/i_,5>-E(g,[/p5>|- 

= |E(C/i_,.g - 5, C/p[/i-e5> + E(g, C/pC/i_, - f/pff) | < 

< \\Ui-,g~g\\2 + \\Ui-eUp-Upg\\2 

where the inequality follows from Cauchy-Schwarz and commutativity of Up and Ui^e- By ['^, Theorem 
4.20], if g : M" ^ R, g e then 

Ui-^g ^ (7 in as e ^ 

Clearly, this extends to vector-valued functions as well, hence the result follows. □ 

Analogous to the discrete setting we say that g : M" A, is balanced if ^[g{X)] = il for X ~ N(0, 1). 
The following lemma shows for any fuzzy partition a non- fuzzy partition with almost the same expectation 
and noise stability (as measured in Theorem 1.2 and Conjecture 1.4) can be created. 

Lemma 2.14. Fix p E ^kZj: 1 o.iT-d Qa < Q- Suppose Xi, . . . ,Xk ^ N(0,/„) and Co\-{Xi, Xj) ~ pin for 
i ^ j ■ Then, for any e > and gi, . . . , gk '■ M" Aq, there exist hi, . . . ,hk '■ M" — ^ Eq such that 



^|E%,-Eg,-,| <ge ,Vj 



(14) 



i=l 



qo k qo k 

ee n ^^■■'(^^■) - ee n siAx,) 
j=i i=i j=i 



< e 



(15) 
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Proof. Assume first that p G ^—^7^,1^ so that the normal distribution is non-degenerate. Discretize R" 

with cubes [0,(5)", i.e. write M" = (5Z" x [0,(5)". where (5Z" denotes the n-dimensional integer lattice scaled 
by a factor S. 

Let Zi_j — S so that Zi denotes the cube Xi is in, and let Uij be i.i.d. uniform on [0, 6], independent 

of Xi,...Xk. 

Further let t] be the density of {Xi, . . . , Xk) and fj the density of {Zi + Ui, . . . , Z^ + Uk)- By continuity 
of 77 we have pointwise convergence, 

fi{x) — > rj{x) as (5 — > 



By dominated convergence, this implies that we can choose S so that 



\r]{x) - ?~]{x)\ dx<- 



Hence, for any / : 



[0, 1], we have 



J {x)rj{x)dx 



f {x)fj{x)dx 



< 



f{x) \r]{x) - fi{x)\ dx < 



(16) 



Each non-fuzzy function hj is constructed from gj by transferring masses internally in each cube. More 
specifically, hj is defined arbitrarily on each cube with the only restriction that 

E[/i,(Zi + Ui)\Zi] = E[g,(Zi + Ui)\Zi] 



(For instance, if F,[h{Zi + Ui)\Zi = zi] = ^, then we may divide the cube zi -I- [0,(5)" into q parts of 
conditional measure ^i, . . . and assign the value ei, . . . , Cg respectively to each part.) Thus, 



.,^U,)\Z,] = 



qo k qo k 

eE n + c^i) = eE n ng^Zj 

1=1 j = l i=l j = l 

qo k qo k 

= eE n nhjAz, + u,)\z,] = eE n '^jv.C^, + u,) 

i=l j = l 4=1 j = l 

Applying (16) twice gives (15). Similarly 

Eg,-,(Zi + Ui) = E[E[5j-,(Zi + Ui)\Z{\] = E[E[%,(Zi + Ui)\Zi]] ^Eh.^Z^ + Ui) 

and two more applications of (16) gives | 'Eigj^Xi) — 'Eihj^Xi)] < e and (14) follows. 

The two degenerate cases can be handled in a similar way by using a density with respect to a lower 
dimensional Lebesgue measure. □ 

We also need a simple result that states that, for instance, almost balanced functions cannot be much 
more stable than balanced functions: 



Lemma 2.15. Fix p G [— jpY, 1],. 9o < 9 o^nd pi, . . . , pk € R''- Suppose Xi, . 
normal with Cav{Xi, Xj) = pin for i ^ j ■ Let gi, . . . ,gk : M" Eg with 



,Xk ^ N(0,/„) are jointly 



1=1 



Then, there exist /ii. 



— > Eq with E hj — pj such that 

qo k qo k 

i—l j—1 i—1 — 1 



k 



Proof. Clearly, it is enough to change the value of gj on a set of Gaussian measure ^ (and such sets can 
easily be find since the Gaussian density is continuous). Thus, we can create a function hj with E/ij = pj 
such that P{gj{Xj) ^ hj{Xj)) — ^, and the result follows by the union bound. □ 
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3 An invariance principle 



Let / : Yli=i ^ K be a function on a finite product probability space and express it as a multilinear 
polynomial Q{X) over an independent sequence of orthonormal ensembles as in (10). The invariance principle 
of [29] (see also earlier results in [-i')]), shows that if Q has low degree and each coordinate has small influence 
then the distribution of Q{X) does not change by much if we replace the variables Xij with independent 
standard Gaussians Zij. 

In [ ] the invariance principle was extended to the case of vector- valued functions / = (/i, . . . , fk) where 
fj : n"=i ^ K for each j. 

Theorem 3.1. ([^^ ], Theorem 4-1 md 3.16) Let (11"=! n"=i Mi) finite product probability space, 
a > the minimum probability of any atom in any fii and X = {Xi, . . . ,X„) an independent sequence of 
orthonormal ensembles such that Xi is a basis for functions ili M. Let Q be a k-dimensional multilinear 
polynomial such that Yar Qj{X) < 1, degQj < d and Inti Qj{X) < r. Finally, let ^ : MJ^ ^ R be a 
function with I*''') | < B /or |r| = 3. Then, 

|E*(0(A')) - E*(Q(Z))| < 2dBfc3 = 0{V^) 

where Z is an independent sequence of standard Gaussian ensembles. 

As suggested in [ , Corollary 4.3], since neither Var Qj{X), degQj or IniiQj depend on whether the 
ensembles are orthonormal, we can simply replace the orthonormal requirement by a matching covariance 
structure requirement. 

Definition 3.2. We say that two independent sequences of ensembles X — {Xi , . . . , <%"„) and y = (3^i , . . . , 3^n) 
have a matching covariance structure if for all i, \Xi\ — \yi\ and 'E[X*Xi] ~ E[3^|3^,;]. 

Theorem 3.3. Let X = (Xi, . . . , Xn) be an independent sequence of ensembles, such that P{Xi = x) > 
a > 0,yi,x. Let Q be a k-dimensional multilinear polynomial such that Yar Qj{X) < 1, degQj < d and 
Inf, QjiX) < T. Finally, let : M.'' ^ R be a function with < B for |r| = 3. Then, 

|E *(Q(A')) - E *(Q(Z))| < 2dBk^ {S/V^Y - 0{V^) 

where Z is an independent sequence of Gaussian ensembles with the same covariance structure as X . 

Proof. For each i, let fi^ be the tr-algebra generated by the variables in Xi. Since a > 0, fi^ is finite, hence we 
can find an orthonormal ensemble X- which is a basis for $7^ — > R and a linear transformation Ai such that 
Xi — XI Ai. Let Z' be any standard Gaussian ensemble and Zi — Z-Ai. Then Z has the same covariance 
structure as X. Let Q' be the multihnear polynomial defined by Q'{X') — Q[X[Ai, . . . ,X!^An). The result 
then follows by applying Theorem 3.1 to Q'{X') while noting that it has the same variances, degrees and 
influences as Q{X). □ 

For our applications we will need a version of Theorem 3.3 for functions ^ which are not functions. 
Instead we will assume that ^I^ is Lipschitz continuous with Lipschitz constant A, i.e. |^'(a;) — ^'(y)| < 
A\\x-yh. 

Theorem 3.4. Let X = [Xi,...,Xn) be an independent sequence of ensembles, such that P(Ai ~ x) > 
a > 0,yi,x. Let Q be a k-dimensional multilinear polynomial such that Yar Qj{X) < 1, degQj < d and 
Infi Qj{X) < T. Finally, let ^' : M''' M. be Lipschitz continuous with Lipschitz constant A. Then, 

|E ^iQ{X)) - E vf (Q(Z))| < DkA (d (8/^/^)' V^) - ©(r^/e) 

where Z is an independent sequence of Gaussian ensembles with the same covariance structure as X and Dk 
are universal constants. 

To prove Theorem 3.4 we need the following lemma which assures that Lipschitz continuous functions 
can be approximated well by C'^ functions. 
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Lemma 3.5. Suppose 



is Lipschitz continuous, i.e. \^ [x) — {y)\ < Ajlx — y||2 for some constant 



A> 0. Then, for all \ > there exists a C°° function 
i) |*(a;) - '^x{x)\ < AX 

n) \^^^{x)\ < 4^ 
where Br,k o,i"e universal constants. 

Proof. Let /i denote the Lebesgue measure on R*^ and let 
defined by 



such that Va; G 



id Vr : Irl = r > 1, 



be the k-dimcnsional bump function 



= J Ce 
1 



"2 if 11x112 <1 
else 



where the constant C is chosen so that /^ggt (j){x)fi{dx) — 1. It is well-known that is C°° with bounded 
derivatives, hence there exist constants Br < oo such that sup^ 

Let *A = ^^fZ-A, i-e. 



For A > 0, let c^xix) - ^0(f ). Then ^A(a;)A*(da;) - 1 and < ^ 

*A(a;) = / 0A(x-t)'J'(t)^(di) 

"'||K-t||2<A 

By the mean value theorem, ^aI-t) = *(^), for some t : \\x - t\\2 < A. But \^{t) - ^{x)\ < A\\x ~ t\\2 < AX, 
which proves i). 

Without loss of generality we may assume that r = ei + r2, where ei — (1,0,..., 0)* is the first unit vector. 
Since 5* is bounded on ||a; — t\\2 < A, ^'a is C°° and for any s. 



II a; — ill 2< A 



<P^^\x-t)^{t)^iidt) 



Thus we may write 



^i^\x) 



A /' cl,^;^\x~t)^{tMdt) 



X — t\\2<^ 



d 

dxi 



lim 



lim 



t||2<A 



(b^^'\t)^ix-tUdt) 



t||2<A 



0A W r Kdt) 



^(r2),.^ (^(a^ + ^el-t)-^(3;-t)) 

0A (*) Z l^(dt) 

t\\o<X " 



< 



< lim 

J||t||2<A 

Br 



(*(.T + hei -t)- ^{x - t)) 



H{dt) < 



Br -I 
Xr-1 



Taking Br^k = Br-i2^ proves ii). 

Proof of Theorem 3.4. Let 5'a be the approximation given by Lemma 3.5. Then, 

|E«'(Q(A'))-E*(g(Z))| < |E*A(g(A'))-E*A(Q(2))|+2AA< 

2Ae d 
< ^ + 2AX , where e = dSa.fc/c^ (S/V") 

where we have used Theorem 3.3. Picking A = e^/^ and letting = AkB^J^ gives the result. 



□ 



□ 
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Our final version of the invariance principle replaces the bounded degree requirement with a smoothness 
requirement which can be achieved by applying the Bonami-Beckner operator Ti-^y on Q{X) for some small 
7 > 0. Later we will use Lemma 2.8 to show that this smoothing is essentially harmless for our applications. 

Theorem 3.6. Let X — {Xi, . . . , Xn) he an independent sequence of ensembles, such that 'P{Xi — x) > a > 
0,yi,x. Fix 7,r e (0,1) and let Q be a k-dimensional multilinear polynomial such that 'Var Qj(X) < 1, 
Var Qf^ < (1 - t)^'' and Ini, Qf'^iX) < t, where d = log i/ log i . Finally, let * : M*^' ^ M &e Lipschitz 
continuous with Lipschitz constant A. Then, 

|E*(Q(A'))-E*(Q(Z))| <CfeAT*/i°si 



where Z is an independent sequence of Gaussian ensembles with the same covariance structure as X and Ck 
is a constant depending only on k. 

To prove Theorem 3.6 we need following easy lemma which bounds the effect of small deviations on 
Lipschitz continuous functions. 

Lemma 3.7. Suppose 5* : M*"' R is Lipschitz continuous, i.e. \'i> (x) — 'i/ {y)\ < A\\x — y\\2 for some constant 
A> Q. Then, for all random variables X,^ taking values in M'^, 



|E*(X + -E«'(X)| < A ^Ee 
Proof. 

|E*(X + 0-E*(X)| <E\-ii{X + ^) -^{X)\ <EA\\i\\ 



2 



1/2 / V 1/2 



C2 



□ 



Proof of Theorem 3.6. The proof is by truncation of Q at degree d = log log i. Without loss of 
generality we may assume that a < \ (else, all random variables are constants and the result is trivial). 
Using Lemma 3.7 twice (with ^ = Q^'^{X) and ^ — Q^'^{Z) respectively) and noting that Theorem 3.4 holds 
for all positive real values on d we find, 

|E*(g(A'))-E^'(Q(Z))| < \¥^-^{Q^'\X))-E-^{Q^'\Z))\+2AVk{\--fY 

< DkA (16/V^''/^ ri/*^ + 2AVke-''^ 

The result now follows by noting that 
and 

(16/V^)'/'t1/« = e^'°s^ri/6 = r"^i°g^/i°giri/6< 

< ^-Jj^l/6 ^ < ^^/logi 

where both inequalities uses that a < \ and the last also that 7 < 1. □ 



3.1 Projective Lipschitz functions 

In our applications the test function can be decomposed into a projection fc onto some compact convex 
subset C C R*-' and a function a Lipschitz continuous function C ^ M. The projection fc : K'' M*^ is 
defined by f{x) being the unique point y Cz C which minimizes |jx — y\\2. The following standard lemma 
states that such projections are always Lipschitz. 
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Lemma 3.8. Let C C R"^ he a compact convex subset. Then fc is well- defined and Lipschitz continuous 
with Lipschitz constant 1. 

Proof. Let us first establish that fc is well-defined. Fix a; € R*^. By compactness, there exists a y which 
achieves inf^gc \\x — y\\2- For uniqueness, suppose y' also achieves this, i.e. \\x — y\\2 — \\x — y'\\2- By 
convexity of C, y* := G C . Still \x — y*\^ = \\x — y\\2 — ^\\y — y'\\2- But since y minimizes ||x — y\\2 we 
must have y — y'. 

Let us now turn to Lipschitz continuity. Fix a;, y e R'^. We need to show that \\fc{y)'~fc{x)\\2 < Hy — a^lb- 
If fc{x) = fc{y) we are done. Otherwise, let L denote the line passing through fc{x) and fciv) and let 
XL and yL denote the orthogonal projection of x and y onto L. Clearly, \\xl — J/ilh < \\x — y\\2- By 
convexity, the intersection Cl ~ C O L oi C and the line is a segment of i. It remains to show that 
||/(x) — f{y)\\2 < \\xl — yL\\2- But this is easy to see by considering three cases depending on whether Ci 
and the segment [f{x),f{y)] of the line L are disjoint, one is contained in the other or they only partially 
overlap. □ 



4 Proof of the Exchangeable Gaussians Theorem 

In this section we prove the EGT, Theorem 1.2. Our starting point will be the extended Riesz inequality on 
the sphere [5, 25]. Let S™^^ C R™ be the m—l -dimensional sphere of radius 1 in R™ and for any Borel 
measurable set A C S™~^, define its spherical rearrangement A* with respect to a point x* e S™^^ as the 
spherical cap centered at x* with the same measure as A, i.e. A* = {x : ||x — x* II2 < a} for a chosen so that 
A and A* has the same measure. 

Theorem 4.1. [r>, Theorem 3] Fixm>l and for Ai, . . . , Ak e B(S™^i) let 

k 

i=l l<i<j<k 

where Ki,j : R+ — > R+ are non-increasing functions and Xi, . . . Xk are i.i.d. uniform on S'"^"'^. Then, for 
any Ai, . '. . , ^Ifc G 1(8™-!), 

JKiAi,...,Ak) < JKiAl,...,Al) 



where A^, . . . A^ are the spherical rearrangements of Ai, . . . , Ak with respect to some fixed point x* € S™ ^ . 

We will prove a slightly more general version of Theorem 1.2 allowing for more general kinds of noise in 
each dimension of the k Gaussian vectors, and different and possibly non-balanced sets for each vector. In 
the rest of this section we will think of the vectors Xi, . . . Xk as being column vectors in a matrix X , and 
we will write X,i for the i'th row vector of X . 

Definition 4.2. Let E e M'^^'^ be positive definite. Then the Gaussian E-noise stability of Ai, . . . , Ak E 

B(M") IS 

. . . , Afe) = P(Xi e Ai, . . . , Xfe e Afe) 

where X,i, X,n are i.i.d. N(0, E). 

We also let fj, — §[1] denote the standard Gaussian measure on R". 

We will first prove a corresponding result on the sphere from which Theorem 1.2 can be derived based 
on Poincares observation that Gaussian measure on R" is obtained by projection of the uniform measure on 
gm-i Qj^^Q as m ^ 00. Let us first define spherical H-noise stability. 

Definition 4.3. Let S e R'^^'^ be positive definite. Then the spherical S-noise stability of Ai, . . . , A^ € 

B(S"-i) IS 

S^iA,,...,Ak)^P{X,eA,,...,XkeAk) 

where X,i, . . . , X,„i o,re i.i.d. N(0, E) and Xi — jxf][^- 

We also let fi = S[i] denote the uniform measure on the sphere S™~^. 



17 



Theorem 4.4. Let E G 



fkxk 



be 



positive definite with (S ^"^^ j — ^ f'^^ * 7^ J- Then, for any Ai, . . . , G 
^^{Ai,...,Ak)<MHi,...,Hk) 



where Hi = {x € S"* |a;i < ai} for ai chosen so that /i(-ffi) = fL{Ai). 



Proof. 



§s(Ai,...,Afe) = P{XieAi,...,XkeAk 



m 



dx 



1=1 



where C — (2''7r'''|E|) Changing the order of summation in the exponential we get 



:iA,,...,Ak) ^ C 1 



'dx 



l<i<j<k 



CiE 



^ IIZ1II2 '^^l---' ||Zfc|l2 ^'^^ 



, n 



l<i<j<fe 



where Zi,. . . ,Zk are independent with Zi e N(0, -^^^^^ — ^m) and Ci = nj=i(^ Conditioning 
on the lengths of the Z^'s we have 



Ss(Ai,...,Afe) = CiE 



E 



l^il 



(17) 



i<i<j<fc 

The inner conditional expectation can be expressed (almost surely with respect to the measure on the lengths) 



as 



E 



^{ZieAi,...,Zfc6Afc} 11 ^ 



l<i<j<k 



(18) 



where Zi, . . . ,Zk are i.i.d uniform on S™ ^ . But since Zi ■ Zj is decreasing in \\Zi — Zj\\ and (S ^)-<0 
for i 7^ j, Theorem 4.1 implies that replacing each Ai with A* in (18) will not decrease the value of (18) and 
hence not the value of (17). Thus, §e(Ai, . . . , A-) < §e(^I , • • • , ^fc) ^ S^^Hi, . . . , Hk). □ 

We are now ready to prove the more general version of Theorem 1.2 allowing for more general noise as 
well as for sets of arbitrary fixed measure. 

Theorem 4.5. Let E e R''^'^ be positive definite with (E^^)^. . < for i 7^ j . Then, for any Ai, . . . , A^ E 

B(M"), 

MAi,...,Ak)<§^{Hi,...,Hk) 
where Hi = {x G M"|a;i < a^} for a,; chosen so that ^{Hi) — /i(Ai). 

Proof. For fixed m> n, let X,i, . . . , X,m be i.i.d. N(0, E) and X^ = 

Further, let Yi = {Xi^i, . . . ,Xi,n) denote the restriction of Xi to the first n coordinates, and similarly 
Yi = {Xi^i, . . . ,Xi^n). Then, the central limit theorem implies 



(Fi, ...,Yk)^{Yu...,Yk) (^sm^oo 
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Suppose first that Ai, . . . ,Ak are closed sets. Then, by [1 ] , Theorem 2.4], 

hmsupP ((?!,..., no e n - P I (^1' ■■■^Yk)eY[AA 

""^^ \ ^=l J \ i=l J 

i.e. _ 

limsupSE(Ai X R'"-", . . . ,^fe X R™"") < ■ ■ • ,^fc) (19) 

m — >cxD 

if on the other hand has a boundary of zero measure, so a similar apphcation of [ , Theorem 2.4] gives 

hmsup§s(ffi X R™-", . . . , iJfc X R'"-") = Ss(i/i, • ■ ■ , Hu) (20) 

m — >oo 

But by Theorem 4.4, 

Ss(Ai X R™~", . . . , ^fe X R™-") < §s(-ffi X R™-", ...,HkX R™-") (21) 

hence. Combining (19), (20), (21) gives the result for closed Ai, . . . , Ak- 

If not all Ai^s are closed, regularity of the uniform measure jl implies that for all e > and i £ [k] there 
exist closed A\ C Ai such that ^i{A'^) > /i(Ai) — e, and hence §s(^'i, ■ ■ • , A'j,) > §s(^i, ■ ■ • , Ak) — ke, and the 
result follows from the result for closed sets by letting e ^ 0. □ 

The EGT now follows as a special case of Theorem 4.5, 

Proof of Theorem 1.2. Take S^.j — p + [1 — p)5ij in Theorem 4.5 and note that the inverse of E is given by 
(S-i)^ ^. ^-a + b5,j, where 6 = and a (i-p)(i+p(fc^i)) > for p > 0. □ 

Remark: As pointed out to us by an anonymous referee, the special case of the EGT where all sets Ai 
are identical (and all pairs Xi, Xj have the same covariance p > 0) also follows from Borell's result [4]. See 
also [2*1, Theorem 4.1]. 

5 Exchangeable low influence bounds 

Combining the EGT and the invariance principle allows us to derive stability bounds on discrete low-influence 
functions. In this section we derive a general bound on the stability of discrete low- influence functions which 
is used for the applications in the next section. 

Letting S(X) denote the cr-algebra generated by X we show. 

Theorem 5.1. Fix p g [0, 1] and let Xi, . . . ,Xk € f^" be random column vectors such that the row vectors 
X.i, . . . , X,n are i.i.d. with p(E (Xi^i) , . . . , E (^fe,i) ; P) = p' < 1 and Xi, . . . , Xk are pairwise p-correlated 
in that for any ji ^ j2 , 

n 

P(X,, = X\X,, = ^) = n (/'IfA.^-.} + (1 - P) P(^,„« = AO) (22) 

1=1 

Then, for any e > there exists a T(e,k,p') > such that for any fi,...,fk ■ f^" ~^ [0? 1] satisfying 
maxij Infi fj < t, 

k 

En/j(^j-)<P(Vj:^, <aj) + e (23) 

where . . . , Zk ^ N(0, 1) are jointly normal with Cov(Z,:, Zj) = p for i ^ j , and each aj is chosen so that 
P{Z,<a,) = Ef,{Xj). 
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Proof. Let m = — 1, M = nm and Gi, . . . ,Gk G N(0, Im) be jointly normal with Cov(Gj^ , Gj^) = pIm 
for ji ^32- 

Note that all variables Xj,i have the same marginal measure. Thus, we can fix an orthonormal basis 
V(x) = {Vq{x) = 1, Vi(x), . . . , Vm{x)} for functions ^ R under this marginal measure and form orthonor- 
mal ensembles 

Xl = (1, Vi{Xj^i), VmiXj^,)) , for i e [n] and j € [k] , and 

Si = (1, ■ • • , , for i e [n] and j G [fc] 

and independent sequences of orthonormal ensembles 

= {X(, ...,X^) and 5^" = [Gi, for j S [fc] 

Then A'-' is a basis for all real- valued functions on Xj and we can compute the (unique) multilinear polynomial 
Qj such that fj{Xj) — Qj{X^). Hence we may write. 



k k 



(24) 



For each j let Qj ~ Ti^^Qj be a slightly smoothed version of Qj. Since p' < 1, by Lemma 2.8 we can find 
a 7 = 7(e, /c, p') > such that 



^llQjiX^)--El[Q,iX^) 

3=1 i=i 



< 



2k 



(25) 



Let fio.i]{x) = max(0, min(l, x)). Since Qj has range [0, 1], the same holds for Qj. Hence, for all j, 

QjiX^) = f[o,i]QA^') (26) 

Now form new ensembles, 

X, = (1, Vi{Xi,,), VmiXl^^), Vl{Xk,^). . . . , VmiXk,^)) ioT I G [n] , and 



Gi — (1, Gi^rn{i-1) + 1, ■ • ■ i Gi^rn(i-l) 



1 . . . , Gk^m(i-i)+m) for i G [n] 



and note that X = {Xi, . . . , Xn) and = {Gi, ■ • ■ , ^/n) are two independent sequences of ensembles with a 
matching covariance structure, since by (22), for ji ^ j2 G [fc], 

E[Vi^{Xj^^i)Vi^{Xj^^i)] = pl{i^=i^} = E[Gj^_,„(j_i)+/^Gj2_„i(j_i)+/J 

Further, lidiQj{X) = lidi f{Xj). Hence, we may apply the invariance principle (Theorem 3.6) on the 
/c-dimensional multilinear polynomial 

Q{X)^Q{Xi,...,Xr.) = [Qi{X')...Qk{X'')f 



using ^'(xi, . . . , Xk) = 11^=1 /[o.i] i^j) which is Lipschitz by convexity of [0, 1] , Lemma 3.8 and the product 
Yl'j=i being Lipschitz on [0, 1]*'^^. Thus, by Theorem 3.6, there exist some r > such that. 



k k 



< 



(27) 



Now (/[o.ijQ, 1 — /[o.i]Q) applied to G-' can be thought of as a function R" — > A2 creating a fuzzy partition 
of the n-dimensional Gaussian space. Let p,j = F,fj{Xj) ~ E/jo_i]Qj(A'^). Then a second application of 
Theorem 3.6 with \E'(x) = /[o,i](a;) gives 



Mi 



< 



20 



By Lemma 2.14 and 2.15, there exist functions gi,. . . ,gk ■ E2 with Eigj^i{Gj) — /ij and 

k k 

4fc 



E n flo.iMSn < E n 9jAGj) + ^ (28) 
But any such gj partitions R^^ into 2 parts of of measure /ij and 1 — fij respectively, so Theorem 1.2 impUes 

k 

E n 9jAS') < P(Vj : G, e Hj) = P(Vj : Z, < a,) (29) 
i=i 

where Hj ^ {x e R*^|a;i < a^}. Combining equations (24), (25), (26), (27), (28) and (29) gives (23) as 
needed. □ 

6 Applications of the EGT 

In this section we show the two appUcations of the EGT in Condorcet voting and Cosmic coin flipping using 
the influence bounds proved in Section 5. 

6.1 Condorcet voting 

Here we use Theorem 1.2 to show that majority maximizes the probabiUty of having a unique best candidate 
in Condorcet voting (Theorem 1.6). 

Remember that we have n voters selecting a linear order ai G S{k) uniformly at random and let 

Xr'={ \ > -^(^) ,for^GWanda,6GW. 

By considering the 6 possible linear orders of three candidates its easy to see that for any distinct a, 6, c G [k] 
we have 

E[X^>'>] = , VarXp'' = 1 and Cov[Xp^Xp"] = i 

First we will show that the limit of the probability of having a unique best candidate using the majority 
function corresponds to the right hand side of (1). 

Lemma 6.1. Let Z2, . . . , Zk ^ N(0, /„) be jointly normal with Cav{Zi. Zj) = i/„ for i j ■ Then 

hm P[UniqueBest^.(MAJ„)] = P{Z2 E H,. . . ,Zk E H) 

n — >-oo 

where H = {x e W'\xi < 0}. 

Proof. Let Y, = % definition 1.5, 

P[UniqueBestfe(MAJ„)] = P{Y2 > 0, . . . , Yfe > 0) 

But, E[y,] = 0, E[r/] = 1 and Cow[Y,,Yj] = | for i ^ j. Thus, by the central limit theorem, {Y2, . . . , Yfe) ^ 
{Z2, ■ ■ ■ , Zk) and the result follows. □ 

Proof of Theorem 1.6. Clearly, any candidate has the same probability of being the unique best candidate. 
So it's enough to show that the probability that the first candidate is the unique best is maximized by 
majority, i.e. for some r small enough, 

P[UniqueBestj.(/, 1)] < lim P[UniqueBestfc(MAJ„, 1)] + 

n^oo k 

But, 

P[UniqueBestfc(/, 1)] = V{f{X^>^) = . . . - /(Xi>'=) = 1) 
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Let 

To see that p{k) < 1 it is by symmetry enough to show that {Xl>'^) , S (X/>^ . . . , Xl>^) ; P) < 1. But 
this follows by Lemma 2.9, since the bipartite graph of Lemma 2.9 is complete and any edge has a probability 
of at least y since it occurs in at least one ordering. 

Hence, by applying Theorem 5.1 on the vectors . . . ,X^^^ using / as /i, . . . ,/fc we can find a 

T = T(e, k) such that 

P(/(Xi>2) = . . . = /(Xi>'=) = 1) < P(Vi : e i?) + e 
Lemma 6.1 now gives the result. □ 

6.2 Cosmic coin flipping 

Here we use Theorem 1.2 to show that majority maximizes the probability of all players agreeing in cosmic 
coin flipping (Theorem 1.7). Remember that we want to maximize 

:=P(/(Yi) = ... = /(rfe)) 

where each Yi is uniform on { — 1, 1}" and Cov[Fi, Yj] ~ p^In for i ^ j. 

First we will show that the limit of the probability of all players agreeing when using the majority function 
corresponds to twice the right hand side of (1). 

Lemma 6.2. Let Zi, . . . , Zk ^ N(0, /„) be jointly normal with Cav{Zi, Zj) ~ p^In for i ^ j ■ Then 

lim 7'^'='"HmAJ„) = 2P(V? : Z, e H) 

where H ^ {x e W^\xi < 0}. 

Proof. Let X, = ^ Er=i Then, 

T'f ^")(MAJ„) = P(Xi >0,...,Xk>0) + P(Xi <0,...,Xk<0) 

But, 'E[Xj] — 0, E[X|] = 1 and Co-v[Xi, Xj] = for i ^ j. Thus, by the central limit theorem, 
{Xi, . . . , Xk) {Zi,i, . . . , Zks) and the result follows. □ 
Proof of Theorem 1.7. The theorem is trivial for p ~ 1. So assume p G [0, 1). To see that 

p(s(ri),...,E(rfe);P) < 1 

it is by symmetry enough to show that p(S (Yi) , JJ^^^ S (Y, ) ; P) < 1. But this follows fr om Lemma 2.9, 
since every value of Y occurs with non-zero probability and hence the bipartite graph of Lemma 2.9 is 
connected and by finiteness, the minimal probability of an edge depend only on k and p. Hence, by applying 
Theorem 5.1 on the variables Yi, . . . , Yfe twice, first using / as /i, . . . , and then using 1 — /, we can find 
a, T — T(e, k, p) such that 

k k 

= E n fiYj) + E - fiYj)) < 2 P(Vj ■.Z,eH) + e 
Lemma 6.2 now gives the result. □ 



7 Plurality is stablest 

Here we show that Conjecture 1.4 implies the Plurality is stablest conjecture (Theorem 1.10). 

We start by showing an unconditional result that asserts that the most stable low low-degree influence 
functions are essentially determined by most stable partition of Gaussian space into q parts of equal measure. 
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Theorem 7.1. For any q > 2, p £ [^^^i 1] a?^^ e > there exist d and r > such that *//:[?]" ^ 
has Inf^'^(/j) < T, Vz,j, </ien t/iere exists a g : W^^''^^^ — )■ Eq such that Eg = E f and 

|§p(.9)-Sp(/)| <e 

Definition 7.2. for q > 2, let /a, : K'' — > Ag denote the function which maps x to the point in Ag which 
is closest to X. 

Proof of Theorem 7.1. The result is trivial for p = 1 so assume p E [—^^,1). Let (fl x A, p.), with the 

p-correlated measure p{lj, A) = pl{\=uj}^ + (1 ^ p)'^ be our base space and let (w, A) G [(?]" x [q]" be drawn 
from /Lt". 

Fix an orthonormal basis V(x) = {Vb(a;) = 1, Vi{x), . . . , Vq-i{x)} for functions [q] M and construct 
two sequences of orthonormal ensembles X = {Xi, . . . ,Xn} and y — {yi, . . . ,yn} for functions ^ R and 
A ^ R by letting Xij{uji) — Vj{uji) and Yi,j{Xi) = Vj{Xi). Note that this means that 

Expressing f{xi, . . . , Xn) as a q-dimensional multi-linear polynomial Q{V{xi), . . . , V(a;„)) we get 

§p(/) = ^E[/,H/,(A)] - ^E[Q,(A')Q,(3;)] (30) 

i=l i=l 

Let Q = Ti-^Q be a slightly smoothed version of Q. To show that p{Vl, A; /i) < 1, observe that p(r2, A; /i) 
is given by the supremum of E[/i(a;)/2(A)] over all /i : R and /2 : A ^ R such that Efi ~ and 

Eff^l. But E[/i(w)/2(A)] = E[/i(w)(p/2(co) + (1 - p) E/2)] = pE[/i(w)/2(co)] < |p| by Cauchy-Schwarz. 
Hence, p{il, A; ^) < jpl < 1 and by Lemma 2.8 we can find a 7(5, p, g) > s.t. 

nQAx)QAy)] - Eio^-w^cv)]! < ^ (31) 

Since Q(A') has range A,, the same holds for Q{X). Hence, 

/a,Q(A') = Q{X) (32) 

(and similarly for y). We are now ready to apply the invariance principle (Theorem 3.6) using ^{x,y) — 
{f\q{x), /a, (2/))- To see that 4'(x,?/) is Lipschitz continuous note that /a^ is Lipschitz by convexity of A^ 
and Lemma 3.8, and the inner product {x,y) = X^iLi is Lipschitz on A^. Hence Theorem 3.6 implies 
that for some r > small enough, 

E[(/A,Q(A'),/A,g(3^))] -E[(/A,Q(e),/A,Q(W))]| < ^ (33) 

where Q and 7i are two Gaussian sequences of orthonormal ensembles with 

Ja^Q applied to Q or H can be thought of as a function R"'^^^^ — > Ag creating a fuzzy partition of the 
n{q— l)-dimensional Gaussian space. The balance of this partition might not equal the balance of / though. 
In particular, 

E / = EQ{x) = E/A,g(A') ^ E/A,g(e) 

But applying Theorem 3.6 again, using 'I'(x) — f\^j{x) which by Lemma 3.8 is Lipschitz continuous with 
A — \^ we can bound the total variation distance by 

ElE/A„,Q(t/)-E/A„,Q(A')| <g^ = | 
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Hence, by Lemma 2.14 and 2.15 there exists a function g : R"('? Eq such that E g = E / and 



E[(/A,g(a),/A,g(H))]-E[(5(^),g(H))] <- (34) 

But 'Ei[{g{Q),g{H))] = Sp{g), hence combining equations (30), (31), (32), (33) and (34) gives the desired 
result. □ 

In order to prove Theorem 1.10 we first show that the limit of the noise stabihty of PLURn, q corresponds 
to the right hand side of (3). 

Lemma 7.3. Fix p e [-^, 1] and q>3. Let X,Y ^ N(0,/,_i) and Cov(X, F) = piq-i. Then 

lim §p(PLUR„.,) = P((x, r) e ^2 ^ • • • u si) 

where Si, . . . ,Sq is a standard simplex partition o/M'^^^. 

Proof. Let Si, . . . , Sq be the standard simplex partition determined by the unit vectors ai, . . . , G R'~^ 
according to Definition 1.3. By Definition 1.8, 

§p(PLUR„,,) = E(PLUR„,g(c^),PLUR„,,(A)) 
where to, X are uniform on [q]" and satisfy (4). Let 



Then, conditioning on having no ties which will happen with probability 1 as n — > oo, we have 
PLUR„,,(w) = J ^ [Vi ^ J : Viio) ■ a, > Vico) ■ a,] ^ y(w) G S*,- 

§p(PLUR„,,) = P[{V{uj), V{X)) eSfu---U S^q] (35) 



and 
Now, 
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E\a^] = 0, since Efa^.l • a, = 1 - (g - 1) — - — = 

q — 1 

EK.a^J = —^Iq~i, since E[a„,a^ ] • = Oj + Ofe — = — '—aj 
g — 1 1 ./ J g — 1 g — 1 

EK.aJj = E[a^, E[ajjt^i]] = E[/9a^,a5'j = p^-^Vi, 

Hence, by the central limit theorem, (V{uj),V{X)) converges to a normal distribution with the same 
parameters as {X,Y). Thus, 

lim P{{V{lj), V{X)) eSfu---US^)= P{{X, Y) e Sf U ■ ■ ■ U Sl) 

which together with (35) gives the result. □ 

Proof of Theorem 1.10. Fix q > 2, p ^ 1] a-nd e > and let r and d be the constants given by 

Theorem 7.1. For any / : [g]" — > Ag with Inf^''(/j) < r, Vi,j, we can thus find a g : M"'^^^) such that 
Eg = E/ and | ^p{g) — Sp(/)| < £• But Conjecture 1.4 and Lemma 7.3 implies that 

§n(5) < 1™ Sp(PLUR„ „) if p > and g is balanced 

and 

Sp(<?) > lim §p(PLUR„,5) if p < 
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which gives the slightly stronger result that only requires Inf^'' fj small. That we may replace this low 
low-degree influence requirement with the simpler low influence requirement follows by noting that 

InfP(/,) < Inf,(/,) 

□ 



7.1 From Discrete to Continuous 

We have shown that the SSC implies the Plurality is Stablest Conjecture. We now show that the reverse is 
also true for p > ~^3Y' thereby establishing the equivalence of the Plurality is Stablest Conjecture and the 
SSC for p in this range. 

Theorem 7.4. For any q > 2, n > 1, p £ [^^ZTfi 1], e > 0, t > and g : M" Eg there exist an m and an 
f : [g]™ — > Aq with Infi(/j) < T,yi,j such that E / = Eg and 

\Spif)-Sp{g)\<e (36) 



Proof. Let g — Ui-sg for some small J > be a smooth version of g. By Lemma 2.13 we can pick S small 
enough so that 



\Sp{g)~Spm < 



(37) 



Let m = rn for some r to be determined later and lu uniform on [g]™ and AG [g]"* selected according to (4). 
For i = 1 . . . n, let 



V^{UJ) 



and define / by /(w) = g{V,{u})). By the central limit theorem {V{u}),V{\)) ^ {X,Y) as r ^ oo where 
X^Y has the distribution of Conjecture 1.4. Since g is bounded and continuous (Lemma 2.12) we have 

Sp(/)-§p(ff) =|E(5(y(A)),ff(y(c.)))-E(5(X),5(r))|^0asr^oo (38) 



/ might not have the same balance as but by changing its value on at most ^ X]j=i E fj — cjj points we 
can create an / such that E / = E g (= E g) and 



§p(/V§p(/)| <Eh/.■-E5, 



as r — > oo 



(39) 



Picking r large enough and combining (37), (38) and (39) gives (36). 

It remains to show that the influences of / can be made small. But this follows from g being Lipschitz; 
changing only one variable, say a;(i_i)m+i cannot change V{uj) by more than ^ , so 



Inf /j = E[ Var fj\ < A J-^ ^ < t 



for r large enough. 



□ 



Proof of Theorem 1.11. This follows by combining Theorem 7.4 and Lemma 7.3 and letting the e of Theorem 
7.4 go to 0. □ 



8 Conclusion 

In this paper we have demonstrated the relationship between optimally noise stable low-influence partitions 
of discrete space and optimally noise stable partitions of Gaussian space. In particular we have applied this 
relationship to various problems in social choice theory and hardness of approximation in computer science. 
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Of the two generalizations of Theorem 1.1 considered we have proved one. The other one remains an open 
problem. We also note that the two directions of generalizations can be combined yielding a more general 
conjecture stating that the standard simplex partition of M" into q > 2 parts maximizes the probability that 
k > 2 positively correlated Gaussians fall into the same part. 

It should be noted that our results give a direct correspondence between discrete low-influence noise 
stability and Gaussian noise stability. That is, even if the SSC is false, whatever the most stable partition 
of Gaussian space is, it can be used to construct a most stable low-influence balanced social choice function. 
Moreover, as long as the least stable partition of Gaussian space does not depend on p when p < 0, it will 
give an exphcit optimal UGC hardness result for MAX-q-CUT. 
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A Approximability of MAX-q-CUT 

In this appendix we show that if we assume the Unique Games Conjecture, then the optimal approximability 
constant of MAX-q-CUT is directly related to the most stable partition of Gaussian space into q parts of equal 
measure as described in Conjecture 1.4, and together these two conjectures implies that the Frieze- Jerrum 
SDP achieves the optimal approximation ratio. 



A.l The Unique Games Conjecture 

The Unique Games Conjecture (UGC) was introduced by Khot in [ ]. It asserts the hardness of approxi- 
mating the Unique Label Cover problem within any constant. 

Definition A.l. The Unique Label Cover problem, C — {V, W, E, M, {<7v,w}(v,w)£e)j defined on a bipartite 
graph [V U W, E) with a permutation cr„ : [M] — > [M] associated with every edge {v,w) d E ^ V x W . A 
labeling I : V UW ^ [M] is said to satisfy an edge {v, w) if 

cr(v,w){l{w)) = l{v) 

The value of a labeling I, VAL;(/3), is the fraction of edges satisfied by I and the value of C is the maximal 
fraction of edges satisfied by any labeling, 

YAL{C) = maxVAL,(/:) 



Conjecture A. 2. The Unique Games Conjecture. For any 77,7 > there exists a M = M{rj,^) such 
that it is NP-hard to distinguish instances C of the Unique Label Cover problem with label set size M having 
VAL(>C) > 1 ^ rj from those having VAL(£) < 7. 



A. 2 Optimal approximability constants 

Next, we will show that for any e > 0, MAX-q-CUT can be approximated within — e in polynomial time 
while it is UG-hard to approximate it within Pq + e where the constants aq and Pq are given by. 

Definition A. 3. For q> 1, let 

aq = lim sup inf ^-^9) 



and 

Pq^ lim inf sup V ^ , (41) 



q 1 - §p(g ) 

jq — 11111 1111 oLijj 

ri^oo_^<p<l g q—l l—p 



where the supremum is over all g : R" — s- i?, 
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Note that the limit in (40) and (41) exist since they are hmits of bounded functions increasing with n (we 
can always ignore any number of dimensions while specifying the partition) . 

We now show that a, = /Sq assuming Conjecture 1.4. To do this, we first show that we can restrict 
attention to non-positive values of p and for all such values the standard simplex partition is optimal. 

Lemma A. 4. Fix q -.M" Eg. Then inf - — ^uMl is obtained by p = 0. 

0<p<l 1-p ^' 

Proof. Fix e > 0. By Theorem 7.4 there is an / : [g]™'"' A, with E.g = E / such that 

On the other hand, by (13) and (11) 

Sp(/) - E 11^-112 < II E/ll^ + P(ll/ll^ - II E/ll^) <p+{l- p)|l E/||2 



Hence, 

1 - §p(/) 
1-p 

and by the construction of /, 



>1-||E/| 



'-^^>l-\\^a\\l (42) 
Letting e — > 0, and noting that (42) holds with equality for e = p ~ gives the result. □ 
Theorem A. 5. Assume Conjecture 1.4- Then cxq = [3q. 

Proof. By Lemma A. 4 the infimums in the definition of aq and fiq are obtained for — < P < 0. The 
result now follows from the fact that for p in this range, the least stable partition in Conjecture 1.4 does not 
depend on p. □ 

We now proceed to present the approximation algorithm and the inapproximability argument which 
together implies Theorem 1.13. 

A. 3 An approximation algorithm 

The approximation algorithm presented here is a generalization of the algorithm presented in [!_'] allowing 
for an arbitrary partition to be used when rounding the relaxed solution. The algorithm in [12] corresponds 
exactly to using the simplex partition of Conjecture 1.4, which (as we will see) is optimal if Conjecture 1.4 
is true. 

Let ai, . . . , flq e W^^^ be generating vectors of a standard simplex partition of M''^^, i.e. satisfying (2). 
Labeling the vertices with vectors from ai , . . . instead of numbers from [q] , we can write the value of a 
MAX-q-CUT instance Alq(y, w) as the following discrete optimization problem: 

\kL{Mq)= max ^ E(«,„)6_b " ^« • 

subject to lu £ {ai, . . . , aq} ,\fu Q V 

To obtain the SDP relaxation we allow the vectors to be arbitrary points on the unit sphere S""^^ while 
adding the constraint ■ > ~~:y '^hich by (2) holds for vectors in {ai, . . . , Og}, 

SDP-VAL(X,) := max Eiu,v)eE W(u,v)i^ - Zu ■ z^) 

subject to Zu e M",Vu e V 
Zu - Zu = l,Vu e V 
Zu - Zy > -^,Vii,w e V 

where n = \V\ denotes the number of vertices. 

The rounding applied to the solution of SDP-VAL is parametrized by an integer m, a partition A = 
{Ai, . . . , Aq} of M'" and an error constant 6 > 0, 
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Approximation algorithm Tl{m,A,5). 

1. Compute an almost optimal solution (z„)„gv to SDP-VAL{M.q) using semidefinite programming. This 
will achieve a value of SDP-VAL(Mq) — 6. 

2. Pick a projection matrix T : M™^", by letting Tij be i.i.d. N(0, 1). 

3. For each u d V , let l(u) = i iff Tzu G Ai. 

Let R-VAL(Alg) = VAL;(A^g) be the value of the rounded labeling. Then, the expected approximation 
ratio is: 

E[R-VAL(Xg)] ^ E[R-VAL(.Mg)] _ J2iu,v)eE^(.u,v)P {l{u) Ijv)) ^ 
VAL(A1,) - SDP-VAL(A1 ,) + <5 " ^ Eiu,v)eE (1 - z„ • z.) + <5 " 

g l-P{{Tzu,Tz„)eAlu---UAl) 
> mi — 

^— q-l 

But, Tzu,Tzv G N(0,/m) and Cov{Tzu,Tzv) — {zu • Zv)Im, so by picking m large enough and Ai, . . . ,Aq 
so that the limit in (40) is almost achieved (bar, say S), and then picking d = (5(e) small enough, we get an 
approximation ratio of ag — e, for any e > 0. We have proved the following result 

Theorem A. 6. For any e > there exists a polynomial time algorithm that approximates MAX-q-CUT 
within aq~e. 

A. 4 Inapproximability results 

We will now prove that MAX-q-CUT is UG-hard to approximate within any factor greater than (3q. To do 
so, we present a reduction from the Unique Label Cover problem to MAX-q-CUT following the same outline 
as the corresponding reduction for MAX-CUT given in [2'_!]. The reduction is based on a Probabilistically 
Checkable Proof (PCP) whose proof H consists of the function tables of {fw}w£W, where f^ ■ [q]^^ [q] is 
expected to be the long code of w's label l{w), i.e. fw{x) — x^^y In order to be able to reduce the PCP 
to MAX-q-CUT, the PCP verifier Vp is designed to use an acceptance predicate which reads two random 
function values from the proof and accepts iff they differ. Thus, a MAX-q-CUT instance Aiq can be created 
from the PCP by letting the vertices be the function values that can be read by Vp, the edges the pairs of 
function values that are compared, and the weights the probability of that comparison being made by Vp. 
The verifier is parametrized hy p G [— , 1] . 

PCP Verifier Vp. 

1. Pick V Cz V at random and two of its neighbors w, w' at random. 

2. Pick X E [g]^^ at random. 

3. Pick y G [q]^'^ to be a p-correlated copy of x, i.e. each yi is independently selected using the conditional 
distribution ^ 

KVtlXi) = + (1 - P)- 

4. Accept if f^P^^^^{x) ^ fw'Pa^,,^,{y) 

where P„ : [q\^^ — s- [q]*^ denotes the function Pa{xi, . . . ,xm) = {xa(i), ■ ■ ■ :Xa{M))- 

Using a result from [.!■!] we can assume that the graph is regular on the V side so that (w, w), and similarly 
(w, w'), picked by Vp corresponds to a an edge selected uniformly at random. 
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Lemma A. 7. (Completeness). Fix p G i^'fijj !]• Then, for any Unique Label Cover problem £ with 
VAL(>C) > 1 — ?7 there exists a proof H such that 

P[Vp accepts H] > (1 - 277)^^(1 - p) 
Proof. Let / be the optimal assignment for C and f^ be the long code of l{w), i.e. 

fw (^) — ^l{w) 

With probability at least 1 — 2ri, both edges {v,w) and {v,w') are satisfied by I. In this case, 
fwPa^.^X^) = ^a^,^(i{w)) = a;i(t,) and fw'Pa^^^Av) = yi{v) 
and Vp accepts with probability 

P[xi{v) + yi(v)\ ^i-(^p + = ^^^^ ~ 

□ 

Lemma A. 8. (Soundness). Fix p G [—-^^,1] and e > 0. Then, there exists a -f = '-f{q,p,e) > such 
that for any Unique Label Cover problem C with VAL(£) < 7 and any proof II, 

P[Vp accepts H] < 1 - A^(p) + e (43) 

where 

A-{p) = lim inf Sp(.g) 

^ n— >oo g:R"— 

Proo/. For w e T4^, let : [9]*^ ^ defined by 

lw{x) = e/^„(^) 

map the value of fw onto one of q unit vectors, and for v &V, let g„ : [q]^^ Aq be defined by 

5,(x) = E[/^P,_(x)] 
where the expectation is over a random neighbor w of v. Then, 

P[Vp accepts n] = E [1 - (/^Pa_ (2:), /^'P._, (z/)>] = 

V ,w ,w' .,x ,y 

= 1- E [(5,(x),<7,(y))] = l-E§p(5,) 

Now suppose n is a proof such that (43) is not satisfied, i.e, 

ESp(g„) < A-(p)-e (44) 

We need to show that this implies VAL(£) > 7. To do so it is enough to create a random labeling I such 

that 

E[VALi(£)] >7 

Let Vgood = {v G V\§p{g„) < A-{p) - §}. Since §p(g„) > 0, (44) implies that |Vgood| > f Further, 
for 1; G Vgood, Theorem 7.1 implies that max^ Inf,^'^ gy > r, for some d and r > depending only on q,p and 
e. 

The assignment / is created as follows: 
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1. For V E V, let l{v) — i, where i maximizes Inif'^g^ (ties broken arbitrarily) 

2. For w G W, let l{w) = i with probability proportional to Inf^'' /„,. 
Since (12) holds for vector-valued functions, this means that 



qd 

For V e Fgood, 

r < Infg) 5. = Infg) E[/^,P,„,„ (x)] < E Infg, /^P<,^„„ (, 

= Elnf^l /^(x) < gd P [;(«;) = a-^(/(«))] = P satisfies {v,w)] 

where the second inequality follows from convexity of Inf^'^. Hence, 

E[YAU{C)] ^ P (l satisfies {v,w)) > ^ ■ 
I i,v,w 2 qd 

Picking 7 = f • ^ > finishes the proof. □ 

Together, the soundness and completeness lemmas implies the following inapproximability result for 
MAX-q-CUT: 

Theorem A. 9. For any e > it is UG-hard to approximate MAX-q-CUT within (3q + e. 

Proof. By Lemma A. 7 and A. 8 it is UG-hard to distinguish instances of MAX-q-CUT with value at least 
(1 — 2?7)^^(1 — p) from instances with value at most 1 — A^(p) + e for any 77, e > 0. Thus, it is UG-hard to 
approximate MAX-q-CUT within 

l-A-(p) + e _ q l-A-(p) , 



(l-2r;)^(l-p) q-l l-p 

where e' > can be made arbitrarily small by picking 77 and e small enough. Since this holds for any 
p € TTY, 1] the result follows. □ 
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